Using LazyBoosting for Word Sense Disambiguation
نویسندگان
چکیده
This paper describes the architecture and results of the TALP system presented at the SENSEVAL-2 exercise for the English lexical–sample task. This system is based on the LazyBoosting algorithm for Word Sense Disambiguation (Escudero et al., 2000), and incorporates some improvements and adaptations to this task. The evaluation reported here includes an analysis of the contribution of each component to the overall system performance. 1 System Description The TALP system has been developed on the basis of LazyBoosting (Escudero et al., 2000), a boosting–based approach for Word Sense Disambiguation. In order to better fit the SENSEVAL2 domain, some improvements have been made on the basic system, including: features that take into account domain information, an specific treatment of multiwords, and a hierarchical decomposition of the multiclass classification problem, similar to that of (Yarowsky, 2000). All these issues will be briefly described in the following sections.
منابع مشابه
An Empirical Study of the Domain Dependence of Supervised Word Sense Disambiguation Systems
This paper describes a set of experiments carried out to explore the domain dependence of alternative supervised Word Sense Disambiguation algorithms. The aim of the work is threefold: studying the performance of these algorithms when tested on a di erent corpus from that they were trained on; exploring their ability to tune to new domains, and demonstrating empirically that the LazyBoosting al...
متن کاملOn the Portability and Tuning of Supervised Word Sense Disambiguation Systems
This report describes a set of experiments carried out to explore the portability of alternative supervised Word Sense Disambiguation algorithms. The aim of the work is threefold: rstly, studying the performance of these algorithms when tested on a diierent corpus from that they were trained on; secondly, exploring their ability to tune to new domains, and thirdly, demonstrating empirically tha...
متن کاملBoosting Applied toe Word Sense Disambiguation
In this paper Schapire and Singer’s AdaBoost.MH boosting algorithm is applied to the Word Sense Disambiguation (WSD) problem. Initial experiments on a set of 15 selected polysemous words show that the boosting approach surpasses Naive Bayes and Exemplar–based approaches, which represent state–of–the–art accuracy on supervised WSD. In order to make boosting practical for a real learning domain o...
متن کاملA Comparison between Supervised Learning Algorithms for Word Sense Disambiguation
This paper describes a set of comparative experiments, including cross{corpus evaluation, between ve alternative algorithms for supervised Word Sense Disambiguation (WSD), namely Naive Bayes, Exemplar-based learning, SNoW, Decision Lists, and Boosting. Two main conclusions can be drawn: 1) The LazyBoosting algorithm outperforms the other four state-of-theart algorithms in terms of accuracy and ...
متن کاملرفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA
Word sense disambiguation is the task of identifying the correct sense for the word in a given context among a finite set of possible sense. In this paper a model for farsi word sense disambiguation is presented. The model use two group of features: first, all word and stop words around target word and topic models as second features. We extract topics from a farsi corpus with Latent Dirichlet ...
متن کامل